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D scription 

[0001] This patent claims th benefit of U.S. Provisional Application No. 60/060.623. filed October 1. 1997. whose 
contents are incorporated herein by reference. 

Raid of the Invention 

[0002] This invention relates to newly identified polypeptides and polynucleotides encoding such polypeptides, to their 
use in therapy and in identifying compounds which may be agonists, antagonists and/or inhibitors which are potentially 
w useful In therapy, and to production of such polypeptides and polynucleotides. 

Background Of the Invention 

[0003] The drug discovery process is currently undergoing a fundamental revolution as it embraces 'functional 
75 genomics', that is, high throughput genome- or gene-based biology This approach Is rapidly superceding earlier 
approaches based on 'positional cloning*. A phenotype. that is a biological function or genetic disease, would be iden- 
tified and this would then be tracked back to the responsible gene, based on its genetic map position. 
[0004] Functional genomics relies heavily on the various tools of biolntbrmatics to Identify gene sequences of poten- 
tial interest from the many molecular biology databases now available. There is a continuing need to identify and char- 
20 acterise further genes and their related polypeptides/proteins. as targets for drug discovery. 

Summary of the Invention 

[0005] The present invention relates to HTHBZ47. in particular HtHBZ47 polypeptides and HTHBZ47 polynude- 
2S otides. recombinant materials and methods for their production. In another aspect, the invention relates to methods for 
using such polypeptides and polynucleotides, including the treatment of cancer, inflammation, autoimmunity, allergy, 
asthma, rheumatoid arthritis, CNS inflammation, cerebellar degeneration, Alzheimer's disease, Parl^nson's disease, 
mutttple sclerosis, amylotrophic lateral sclerosis, head injury damage, and other neurological abnormalities, septic 
shocK sepsis, stroke, osteoporosis, osteoarthritis, ischemia reperfusion injury, cardiovascular disease, kidney disease. 
30 liver disease, ischemic injury, myocardial infarction, hypotension, hypertension, AIDS, myelodysplastic syndromes and 
other hematologic abnormalities, aplastic anemia, male pattern baldness, and bacterial, fungal, protozoan and viral 
infections, hereinafter refen-ed to as "the Diseases", amongst others. In a further aspect, the invention relates to meth- 
ods for identifying agonists and antagonists/inhibitors using the materials provided by the invention, and treating condi- 
tions associated with HTHB247 imbalance with the identified compounds. In a still further aspect, tiie invention relates 
35 to diagnostic assays for detecting diseases associated with inappropriate HTHBZ47 activity or levels. 

Description of the Invention 

[0006] In a first aspect, the present invention relates to HTHBZ47 polypeptides. Such peptides include Isolated poly- 
40 petides comprising an amino add sequence which has at least 70% identity, preferably at least 80% identity, more pref- 
erably at least 90% Identity, yet more preferably at least 95% Identity, most preferably at least 97-99% Identity, to that of 
SEQ ID N0:2 over the entire length of SEQ ID NO :2. Such polypeptides include those comprising the amino acid of 
SEQ ID N05. . 

[0007] Further peptides of the present invention include isolated polypeptides in which the amirx) acid sequence has 
45 at least 70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet more preferably at least 
95% identity, most preferably at least 97-99% identity, to tiie amino acid sequence of SEQ ID N0:2 over the entire 
length of SEQ ID N0:2. Such polypeptides include the polypeptide of SEQ ID N0:2. 

[0008] Further peptides of the present invention include isolated polypeptides encoded by a polynucleotide compris- 
ing the sequence contained in SEQ ID NO :1 . 

50 [0009] Polypeptides of the present invention are members of the kringle family of polypeptides. They are therefore of 
interest because kringle domains are found in proteins involved in blood coagulation (thrombin, factor XII). fibrinolysis 
(tissue plasminogen activator, urokinase), liveiregeneration (hepatocyte growth factor), and lipid metabolism (apoiipo- 
protein A). These proteins may have a single kringle domain, as in urokinase, or multiple copies, as in apolipoprotein 
A, which has 38 repeated kringle domains. Kringle motif-containing proteins often have other separate motifs which 

55 perform different functions, such as growth factor and serine protease domains (Campbell, 1. D. et al. Philos. Trans. R. 
Soc. Lon. B. Biol. Sci., 332(1263), p. 165-170, 1991). Kringle domains are thought to aid in binding to fibrin dots or 
membran components, as well as to modulate protease activity in proteins having this activity associated with them 
(Ujnen, H. R. et al. Balli r s Clin. HematoL. 8(2). p. 277-290. 1995). 
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[0010] The proteins, tissue plasminogen activator (t-PA) and urokinase, have both been used dinicall/ in the treat- 
ment of coronary heart disease in both its acute and chronic phases (Gufba, D. C. et al. Ann. Hematbl., 73 Supp1 1 . p. 
S9-27, 1996), and t-PA has be n used clinically in the management ot isch mic stroke (Albers, Q W., West J Med.. 
166(4). p. 253-62, 1994). Both of these proteins aid in the course of the disease by binding to fibrin clots and activating 

£ plasminogen, which degrades the clot by proteolysis. Other kringls-containtng proteins, such as hepatocyte growth lac- 
tor, have been implicated in metastatic cancers, presumably via growth factor domains associated with these proteins 
(Weidner. K. M. et al, EXS. 65. p. 3 1 1 -28. 1993). These properties are hereinafter refened to as "HTHBZ47 activity" or 
"HTHBZ47 polypeptide activity" or "biological activity of HTHBZ47". Also included amongst these activities are anti- 
genic and Immunogenic activities of said HTHBZ47 polypeptides, in particular the antigenic and immunogenic activities 

10 of the polypeptide of SEQ ID NO:2. Preferably, a polypeptide of the present invention exhibits at least one biological 
activity of HTHBZ47. 

[001 1 ] The polypeptides of the present invention may be In the form of the "mature" protein or nriay be a part of a larger 
protein such as a fusion protein. It Is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification such as multiple histidine residues. 

15 or an additional sequence for stability during recombinant produaion. 

[0012] The present invention also Includes include variarrts of the aforementioned polypetides, that is polypeptides 
that vary from the referents by conservative amino acid substitutions, whereby a residue is substituted by another with 
like characteristics. Typical such substitutions are among Ala, Val, Leu and lie; among Ser and Thr; among the acidic 
residues Asp and GSu; among Asn arid Gin; and among the basic residues Lys and Arg; or aromatic residues Phe and 

20 Tyr. Particularly preferred are variants in which several, 5-10, 1-5, 1-3. 1-2 or 1 amino acids are substituted, deleted, or 
added in any combination. • ' 

[0013] Polypeptides of the present Invention can be prepared in any suiteOole manner. Such polypeptides include iso- 
lated naturally occurring polypeptides, recomblnantly produced polypeptides, synthetically produced polypeptides, or 
polypeptides produced by a combination of these methods. Means for preparing such polypeptides are well understood 

25 in the art. i • , 

[0014] In a further aspect, the present invention relates to HTHBZ47 polynucleotides. Such polynucleotides include 
isolated polynucleotides comprising a nucleotide sequence encoding a polypeptide which has at least 70% identity, 
preferably at least 80% identity, more preferably at least 90% identity, yet more preferably at least 95% identity, to the 
amino add sequence of SEQ ID NO:2, over the entire length of SEQ ID N0:2. In this regard, polypeptides which have 

30 at least 97% identity are highly preferred, whilst those with at least 98-99% identity are more highly preferred, and those 
with at least 99% identity are most highly preferred. Such polynucleotides include a polynucleotide con^rising the 
nucleotide sequence contained in SEQ ID N0:1 encoding the polypeptide of SEQ ID N0:2. 
[0015] Further polynucleotides of the present invention include isolated polynucleotides comprising a nucleotide 
sequence that has at least 70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet more 

35 preferably at least 95% identity, to a nucleotide sequence encoding a polypeptide of SEQ ID N0:2, over the entire cod- 
ing region. In this regard, polynucleotides which have at least 97% identity are highly preferred, whilst those with at least 
98-99% identity are more Nghly preferred, and those with at least 99% identity are most highly prefen'ed. 
[0016] Further polynucleotides of the present invention include isolated polynucleotides comprising a nucleotide 
sequence which has at least 70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet 

40 more preferably at least 95% identity, to SEQ ID N0:1 over the entire length of SEQ ID N0:1. In this regard, polynude- 
otides which have at least 97% identity are highly prefen'ed, whilst those with at least 98-99% identiy are more highly 
preferred, and those with at least 99% Identity are most highly preferred. Such polynucleotides include a polynucleotide 
comprising the polynucleotide of SEQ ID N10:1 as well as the polynucleotide of SEO ID N0:1 . 
[0017] The invention also provides polynucleotides which are conrplementary to all the above described polynude- 

46 otides. 

[0018] The nudeotide sequence of SEQ ID N0:1 shows homology with human tissue-type plasminogen activatoi' 
(Harris, T, J. et al., Mol. Biol. Med. 3(3) : 279-292, 1986), Accession # M15518. The nucleotide sequence of SEQ ID 
N0:1 is a cDNA sequence and comprises a polypeptide encoding sequence (nucleotide 87 to 1463) encoding a 
polypeptide of 45i3 amino acids, the polypeptide of SEQ ID N0:2. The nucleotide sequence encoding the polypeptide 

50 of SEQ ID N0:2 may be identical to the polypeptide encoding sequence contained in SEQ ID N0:1 or it may be a 
sequence other than the one contained in SEQ ID N0:1, which, as a result of the redundancy (degeneracy) of the 
genetic code, also encodes the polypeptide of SEQ ID N0:2. The polypeptide of SEQ ID N0:2 is structurally related to 
other proteins of the kringle family, having homology and/or structural similarity with tissue plasminogen activator pre- 
cursor (tPA), (Harris et al., Mol. Biol. Med. 3(3) : 279-292, 1986. QenBank accession # 137119). 

55 [0019] Preferred polypeptides and polynucleotides of the present invention are expected to have, inter alia, similar 
biological functions/properties to their homologous polypeptides and polynucleotides. Furthermore, preferred polypep- 
tides and polynucleotides of th present inv ntion have at least one HtHBZ47 activity. 

[0020] The present invention also relates to partial or other polynudeotide and polypeptide sequences which were 
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first identified prior to the determination of the con-esponding full length.sequences of SEQ ID N0:1 and SEQ ID N0:2, 
[0021] Accordngty, in a further, aspect, the present invention provides for an isolated polynucleotide conprising; 

(a) a nucleotide sequence which has at least 70% identity, preferably at least 80% identity, more preferably at least 
90% identity, yet more preferably at least 95% identity, even more preferably at least 97-99% identity to SEC ID 
N0:3 over the entire length of SEQ ID NG:3; ' 

(b) a nucleotide sequence which has at least 70% identty, preferably at least 80% identity, more preferably at least 
90% identity, yet more preferably at least 95% identity, even more preferably at least 97-99% identity, to SEQ ID 
N0:1 over the entire length of SEQ ID N0:3: 

(c) the polynucleotide of SEQ ID NO :3; or 

(d) a nucleotide sequence encoding a polypeptide which has at least 70% identity, preferably at least 80% identity, 
more preferably at I6ast9b% identity, yet more prelerably .at least 95% Identity, even more preferably at least 97- 
99% identity, to the amino acid sequence of S>EQ ID N0:4, over the entire length of SEQ ID N0:4; 

as well as the polynucleotide of SEQ ID N0:3. 

[0022] The present invention further provides for a polypeptide which: 

(a) comprises an amino acid sequence which has at least 70% identity, ipreferably at least 80% identity, more pref- 
erably at least 90% identity, yet more preferably at least 95% identity, most preferably at least 97-99% identity, to 
that of SEQ ID N0:2 over the entire length of SEQ ID N0:4: 

(b) has an amino acid sequence which Is at least 70% identity, preferably at least 80% identity, more preferably at 
least 90% identity, yet more preferat)ly at least 95% identity, most preferably at least 97-99% identity, to the amino 
acid sequence of SEQ ID N05 over the entire length of SEQ ID N0:4; 

(c) comprises the amino acid of SEQ ID N0:4; and 

(d) is the polypeptide of SEQ ID NO :4; . 

as well as polypeptides encoded by a polynudeotide comprtsing the sequence contained in SEQ ID N0:3. 

[0023] The nucleotide sequence of SEQ ID N0:3 and the peptide sequence encoded thereby are derived from EST 
(Expressed Sequence Tag) sequences. It is recognised by those skilled in the art that there will inevitably be some 
nucleotide sequence reading errors in EST sequences (see Adams, M.D. a/,' Nature 377 (supp) 3, 1995). Accord- 
ingly, the nucleotide sequence of SEQ ID N0:3 and the peptide sequence encoded therefrom are therefore subject to 
the same inherent limitations in sequence accuracy. Furthermore, the peptide sequence encoded by SEQ ID N0.-3 
comprises a region of identity or dose homology and/or close structural similarity (for example a conservative amino 
acid difference) with the closest homologous or structurally similar protein. 

[0024] Polynucleotides of the present invention may be obtained, using standard doning and screening techniques, . 
from a cDNA library derived from rnRNA In cells of human thynujs and fetal liver, using the expressed sequence tag 
(EST) analysis (Adams. M.D.. etal. Science (1991) 252:1651-1656; Adams. M.D. etaL, Nature, (1992) 355:632-634; 
Adams, M.D., et a/.. Nature {^995) 377 Supp3-1 74). Polynucleotides of the invention can also be obtained from natural 
sources such as genomic DNA libraries or can be synthesized using well known and commerdally available techniques. 
[0025] When pdynudebtides of the present invention are used for the recombinant production of polypeptides of the 
present invention, the polynucleotide may Include the coding sequence for the mature polypeptide, by itself; or the cod- 
ing sequence for the mature polypeptide in reading frame with other coding sequences, such as those encoding a 
leader or secretory sequence, a pre-, or pro- or prepro-protein sequence, or other ftision peptide portions. For example, 
a marker sequence which facilitates purification of the fused polypeptide can be encoded. In certain prefenred embod- 
iments of this aspect of the invention, the marker sequence is a hexa-histidine peptide, as provided in the pQE vector 
(Qiagen. Inc.) and described in Gentz et ai. Proc Nati Acad Sci USA (1 989) 86:821 -824. or is an HA tag. The polynu- 
cleotide may also'cbnlain non-coding 5' and 3' sequences, such as transcribed, non-translated sequences, splicing and 
polyat^enylation signals, ribosome binding sites and sequences that stabilize mRNA. 

[0026] Further embodiments of the present inventiori Indude polynucleotides encoding polypeptide variants which 
comprise the amino acid sequence of SEQ ID N0:2 and in which several, for instance from 5 to 10. 1 to 5, 1 to 3. 1 to 
2 or 1. aniiho add residues are substituted, deleted or adjded. in any conibination. 

[0027] Polynucleotides which are identical or suffidently identical to a nucleotide sequence contained in SEQ ID 
N0:1, may be used as hybridization probes for cDNA and genomic DNA or as primers for a nudeic add amplification 
(PGR) reaction, to isolate full-length cDNAs and genomic clones encoding polypeptides of the present invention and to 
isolate cDNA and genomic dpnes of other genes (including genes encoding hpmblogs and orthologs from species 
other than human) that have a high sequence similarity to SEQ ID NO :1 . typically these nudeotide sequences are 70% 
id ntlcal. preferably 80% identical, more preferably 90% identical, most preferably 95% identical to that of the refei'ent. 
The probes or primers yvill generally comprise at least 15 nucleotides, preferably, at least 30 nudeotides and nray have 
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* at least 50 nucleoticles. Particutarly preferred probes will have between 30 and 50 nucleotides. 
[0028] A polyrRJclecTtide encoding a polypeptide of the present invention, including honnologs and orthologs from spe- 
cies other than hunnan, may be obtained by a pr cess which comprises the steps of screening an appropriate library 
under stringent hytxidization conditions with a labeled probe having the sequ nee of SEQ ID NO: 1 or a fragment 

5 thereof, and isolating full-length cDNA and genomic dones containing said polynucleotide sequence Such hybridiza- 
tion techniques are well known to the skilled artisan. Preferred stringent hybridization conditions include overnight incu- 
bation at 42''C in a solution comprising: 50% formamide, SxSSC (150mM NaGI, 15mM trisodium citrate), 50 mM sodium 
phosphate (pH7.6). 5x Denhardt's solution. 10 % dextran sulfate, and 20 microgrann/ml denatured, sheared salnrwn 
sperm DNA; followed by washing the filters in 0.1x SSC at about 65*C. Thus the present invention also includes poly- 

10 nucleotides obtainable by screening an appropriate library under stingent hybridization conditions with a labeled probe 
having the sequence of SEQ ID NO: 1 or a fragment thereof. 

[0029] The skilled artisan will appreciate that, in many cases, an isolated cDNA sequence will be incomplete, in that 
the region coding for the polypeptide is cut short at the 5* end of the cDNA. This is a consequence of reverse tran- 
scriptase, an enzyme with inherently low 'processivity' (a measure of the ability of the enzyme to remain attached to the 
75 . template during the polymerisation reaction), failing to complete a DNA copy of the mRNA template during 1 si strand 
cDNA syntheisis. 

[0030] there are several methods available and well known to those skilled in the art to obtain full-length cDNAs, or 
extend short cDNAs, for example those based on the method of Rapid Amplification of cDNA ends (RACE) (see, for 
example, Frohman et al., PNAS USA 85; 8998-9002, 1988). Recent modifications of the technique, exemplified by the 

20 Marathon™' technology (Clontech Laboratories Inc.) for example, have significantly simplified the search for longer 
cDNAs. In the Marathon™ technology, cDNAs have been prepared from mRNA extracted from a chosen jissue and an 
'adaptor* sequence ligated onto each end. Nucleic acid amplification (PGR) is then carried out to amplify the 'missing' 
5' end of the cDNA using a oonnbination of gene specific and adaptor specific oligonucleotide primers. The PGR reac- 
tion is then repeated using 'nested' primers/that is. primers designed to anneal within the amplified product (typically 

25 an adaptor specific primer that anneals further 3* in the adaptor sequence and a gene specific primer that anneals fur- 
ther 5' In the known gene sequence). The products of this reaction can then be analysed by DNA sequencing and a full- 
length cD NA constructed either by joining the product directly to the existing cDNA to give a complete sequence, or car- 
rying out a separate full-length PGR using the new sequence information for the design of the 5' primer. 
[0031 ] Recont^inant polypeptides of the present invention may be prepared by processes well known in the art from 

30 genetically engineered host cells comprising expression systems. Accordingly, in a further aspect, the present invention 
relates to expression systems which comprise a polynudeotide or polynucleotides of the present invention, to host cells 
which are genetically engineered with such expression sytems and to the production of polypeptides of the invention by 
recombiniant techniques. Gell-free translation systems can also be employed to produce such proteins using RNAs 
derived from the DNA constructs of the present invention. 

35 [0032] For reconi)inant production, host cells can be genetically engineered to incorporate expression systems or 
portions thereof for polynucleotides of the present invention. Introduction of polynucleotides into host cells can be 
effected by methods described in many standard laboratory manuals, such as Davis et al.. Basic Methods in Molecular 
Biology (1 986) and Sambrook et al.. Molecular Goning: A Laboratory Manual. 2nd Ed., Gold Spring Harbor Laboratory 
Press. Cold Spring Hartx5r. N.Y. (1989). Preferred such methods include, for instance, calcium phosphate transfection, 

40 DEAE-dextran mediated transfection, transvection, microinjection, cationic lipid-mediated transfection. electroporation, 
transduction, scrape loading, ballistic introduction or infection. 

[0033] Representative exaniples of appropriate hosts include bacterial cells, such as streptococci, staphylococci, E. 
coli, Sfreptomyces and Baciflus subtilis cells; fungal cells, such as yeast cells and Aspergillus cells; insect cells such 
as Drosophila S2 and Spodoptera SfS cells; animal ceOs such as GHO, COS, HeLa, CI 27, 3T3, BHK, HEK293 and 

46 Bowes melanoma celts; and plant cells. 

[0034] A great variety of expression systems can be used, for instance, chromosomal, episomal and virus-derived 
systems, e.g.. vectors derived from bacterial plasmids, from bacteriophage, from transposons, from yeast epispmes, 
from insertion elements, from yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such 
as SV40. vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived 

50 from combinations tiiereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids 
and phagemids. The expression systems may contain control regions that regulate as welt as engender expression. 
Generally, any system or vector which is able to maintain, propagate or express a polynucleotide to produce a polypep- 
tide in a host may he used. The appropriate nucleotide sequence may be ins^ed into an expression system by any of 
a variety of well-toiown and routine techniques, such as, for example, those set fortii in Sambrook ef a/.. MOLECULAR 

55 CLONING, A LABORATORY MANUAL (supra). Appropriate secretion signals may be incorporated into tiie desired 
polypeptide to allow secretion of the f anslated protein into tiie lumen of the endoplasmic reticulum, the periptasmic 
space or the extracellular environment. These signals may be endogenous to th polypeptide or they may be heterolo- 
gous signals. 
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[00351 ^ a polypeptide of the present i nvention is to be expressed for lise in screening assays, rt is generally preferred 
that the polypeptide be produced at the surface of the ceil. In this event the ceils may be harvested prior to us in the 
screening assay. If the polypeptide is secreted into the medium, the medium can be recovered In order to recover and 
purify the polypeptide. Iff produced irrtracellularly, the cells must first be lysed before the polypeptide is recovered. 

5 (0036J Polypeptides of the present inv ntion can be recovered and purified from recombinant cell cultures by well- 
known methods including anmonium sulfate or ethanol precipitation; add exlractioh. anion or cation exchange chroma- 
tography, phosphocellulose chromatograjDhy, hyilrdphdbic interaction chromatography, affinity chromatography, hydrox- 
' ylapatite chromatography and lectin chromatography. Most preferably, high perfomnance liquid chromatography is 
. : employed for purification. Well known techniques for refolding proteins may be employed to regenerate active confor- 

10 nration when the polypeptide is denatured during isolation and or purification. 

[0037] This invention also relates to the use of polynucleotides of the present invention as diagnostic reagents. Detec- 
tion of a mutated form of the gene chiaracterised by the polynucleotide of SEQ ID N0:1 which is associated with a dys- 
function will provide a diagnostic tool that can add to, or define, a diagnosis of a disease, or susceptibility to a disease, 
which results from under-expression. over-expression or altered expression of the gene. Individuals carrying mutations 

rs inthegenemaybedetectedatthe DN A level by a variety of techniques. 

[00381 Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood, urine, saliva, tissue 
biopsy or autopsy material. The genomic DNA may be used directly for detection or may be amplified enzymatically by ' 
using PGR or other amplification techniques prior to analysis. RNA or cDNA may also be used In similar fashion. Dele- 
tions and insertions can be detected by a change in size of the anplrf led product in cornparison to the norra^ 

20 Point mutations can be identified by hytjridizing amplified DNA to labeled HTHBZ47 nucleotide sequences. Perfectiy 
matched sequences can be distinguished from mismatched duplexes by RNase digestion or by differerices in melting 
temperatures. DNA sequence differences may also he detected by alterations in electrophoretic mobility of DNA frag- 
. ments in gels, with or without denaturing agents, or by. direct DNA sequencing (ee, e.g.\ Myers et aL, Science {^98S) 
230:1242). Sequence changes at specific locations may also be revealed by nuclease protection assays, such as 

25 RNase and S1 protection or the chemical cleavage method (see Cotton a/., Proc Nat! Acad Set USA (1 985) 85: 4397- 
4401). In another embodiment, an array of oligonucleotides probes comprising HTHBZ47 nucleotide sequence or frag- 
ments thereof can be constructed to conduct efficient screening of e.g., genetic mutations. Array technology niethods 
; are well known and have general applicability and can be used to address a variety of questions in molecular genetics 
including gene expression, genetic linkage, and genetic variability (see for example: M.Chee et al., Science. Vol 274. 

30 pp 610-613 (1996)). 

[00391 The diagnostic assays offer a prpcess for diagnosing or determining a susceptibility to the Diseases through 
detection of mutation in the HTHBZ47 gene by the methods described. In addition, such diseases may be diagnosed 
by methods comprising determining from a sample derived from a subject an abnormally decreased or increased level 
of polypeptide or mRNA. Decreased or increased expression can be measured at the RNA level using any of the meth- 

35 ods well known in the art for the quantitation of polynucleotides, such as, for example, nucleic add amplification, for 
instance PGR, RT-PCR, RNase protection. Northern blotting and other hybridization methods. Assay techniques that 
can be used to determine levels of a protein, such as a polypeptide of the present invention, in a sample derived from 
a host are well-known to those of skill intheart. Such assay methods Include radioimmunoassays, competitive-binding 
assays, Western Blot analysis and ELISA assays. 

40 [0040] Thus in another aspect, the present invention relates to a diagonostic kit which comprises: ' 

(a) a polyriudeotide of the present Invention, preferably the nucleotide sequence of SEQ ID NO: 1, or a fragment 
■ thereof; ' . . > , 

(b) a nucleotide sequence complementary to that of (a); 

46 (c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID N0:2 or a fragment thereof; or 
(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of SEQ ID N0:2. 

[00411 It will be appreciated that in any such Wt, (a), (b), (c) or (d) may comprise a substantial component. Such a kit ' 
will be of use in diagnosing a disease or suspectabllity to a disease, particularly cancer, inflammation, autoimmunity, 

so allergy, asthma, rheumatoid artiiritis, CNS inflammation, cerebellar degeneration, Alzheimer's disease, Parkinson's dis- 
ease, multiple sderosis, amyiotrophic lateral sclerosis, head injury damage, and other neurdogical abnonmalities, sep- 
tic shock, sepsis, stroke, osteoporosis. osteoarthVitis, ischemia repertusion injury, cardiovascular disease, kidney 
disease, liver disease, ischemic injury, myocardial infarction, hypotension, hypertension, AIDS, myelodysplastic syn- 
dromes and other hematologic abnormal iti es, aplastic anemia, male pattern baldness, and bacterial, fungal . protozoan 

55 and yiral infections, amongst others. 

[0042] The nudeotide sequences of the present invention are also valuable for chromosome identification. The 
sequence is specif ically targeted to;; and can hybricfize with, a particular location on an individual human chromosome. 
.The mapping of relevant sequences to chrpnridsomes according t the present invention is an important first step in cor- 
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relating those sequences with gene associated disease. Once a sequence has been mapped to a precise chromosomal 
location, th physical position of the sequence on the chronrtosome can he correlated with genetic map data. Such data 
are found in, for exampl , V. McKusicK Mendelian Inheritance in Man (available on-line through Johns Hopkins Uriiver- 
sity Welch Medical Library). The relationship between genes and diseases that have been mapped to the same chro- 
5 mosomal region are then identified through linkage analysis (colnheritance of physically adjacerit genes). 

[0043] The differences in the cDNA or genorriic sequence between affected and unaffected individuals can also be 
determined. If a mutation is observed in sonne or all of the affected individuals but not in any nonnal individuals, then 
the mutation is likely to be the causative agent of the disease. 

[0044] The polypeptides of the invention or their fragments or analogs thereof, or cells e)0ressing them, can also be 
10 used as ihrimunogens to produce antibodies imrnunospecif ic for polypeptides of the present invention. The term Immu- 
nospecific" means that the antibodies have substantially greater affinity for the polypeptides of the invention than their 
affinity for other related polypeptides in the prior art ' 

[0045] Antibodies generated against polypeptdes of the present invention may be obtained by administering the 
polypeptides or epitope-t)earlng fragm»its, analogs or cells to an animal, preferably a non-human animal, using routine 
15 protocols. For preparation of monoclonal antibodies, any technique which provides antibodies produced by continuous 
cell line cultures can be used. Examples include the hybridoma technique (Kohier, G. and Milstein. C, Nature (1975) 
256:495-497). thetrioma technique, the human B-cell hybridoma technique (Kozbor'ef a/.. Immunology Today (1983) 
4:72) and the EBV-hybridoma technique (Cole et at., MONOCLONAL ANTIBODIES AND CANCER THERAPY, pp. 77- 
96, Alan R. Liss, Inc., 1985). 

20 [0046] Techniques for the production of single chain antbodies, such as those described in U.S. Patent No. 4.946,778. 
can also be adapted to produce single chain antibodies to polypeptides of this invention. Also, transgenic mice, or other 
organisms, induding other mammals, may be used to express humanized antibodies. 

[0047] The above-desaibed antibodies may be employed to isolate or to identify clones expressing the polypeptide 
or to purify the polypeptides by affinity chromatography. 
25 [0048] Antibodies against polypeptides of the present invention may also be employed to treat the Diseases, amongst 
others. 

[0049] In a further aspect, the present Invention relates to genetically engineered soluble fusion proteins comprising 
a polypeptide of the present invention, or a fragment thereof, and various portions of the constant regions of heavy or 
tight chains of immunoglobulins of various subclasses (IgG. IgM. IgA. IgE). Preferred as an immunoglobulin is the con- 

30 stant part of the heavy chain of human IgG, particularly IgGi , where fusion takes place at the hinge region. In a partic- 
ular embodiment, the Fc part can be removed simply by incorporation of a cleavage sequence which can be cleaved 
with blood clotting factor Xa. Furthermore, this Invention relate to processes for the preparation of these fusion proteins 
by genetic engineering, and to the use thereof for drug screening, diagnosis and therapy. Afurther aspect of the inven- 
tion also relates to polynucleotides encoding such fusion proteins. Exanples of fusion protein technology can be found 

35 in International Patent Application Nos. W094/2945d and W094y2291 4. 

[0050] Another aspect of the invention relates to a method for inducing an Immunological response in a mammal 
which comprises inoculating the mammal with a polypeptide of the present invention, adequate to produce antibody 
and/or T cell immune response to protect said animal from the Diseases hereinbefore mentioned, amongst others. Yet 
another aspect of the invention relates to a method of inducing immunological respoise in a mammal which comprises, 

40 delivering a polypeptide of the present invention via a vector directing expression of the polynucleotide and coding for 
the polypeptide in wVo in order to induce such an immunological response to produce antibody to protect said animal 
from diseases. 

[0051] A furtiier aspect of the invention relates to an immunological/vaccina formulation (conpositionj which, when 
introduced into a mammalian host, induces an immunological response in that manimal to a polypeptide of the present 

4S invention wherein tiie con^sbsition comprisee a polypeptide or polynucleotide of the present invention. The vaccine for- 
mulation may further comprise a suitable carrier. Since a polypeptide may be broken down in the stomach, it is prefer- 
ably administered parenterally (for instance, sutxxitaneous. intramuscular, intraverious, or intradermal injection). 
Formulations suitable for parenteral administration include aqueous and non-aqueous sterile injection solutions which 
may contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation instonic with the blood of the 

50 recipient; and aqueous and non-aqueous sterile suspensions which may include suspending agents or tiiickening 
agents. The formulations may be presented in unit-dose or multi-dose containers, for example, sealed ampoules and 
vials and may be stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier immediately 
prior to use. The vaccine formulation may also include adjuvant systems for enhancing the immunogenicrty of the for- 

\ . mulation, such as oil-in water systems and other systems known in the art. The dosage will depend on the specif ic activ- 

55 ity of the vaccine and can be readily determined by routine experimentation. 

[0052] Polypeptides of the present invention are responsible for many biological functions, iriduding many disease 
states, in particular the Diseases hereinbefor mentioned. It is therefore desirous to devise scr entng methods to iden- 
tify compounds which stimulate or which inhibit the function of the polypeptide. Accordingly, in a further asp ct, the 
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present invention provides for a method of sere ning compounds to identify those which stimulate or which inhibit the 
function of the polypeptide. In general, agonists or antagonists nriay be employed for therapeutic and prophylactic pur- 
poses for such Diseases as hereinbefore mentioned. Compounds may be identified from a variety of sources, for exam- 
ple, cells, cell-free preparations, chemical libraries, and natural product mixtures. Such agonists; antagonists or 
5 inhibitors so-identified may be natural or modified substrates, ligands, receptors, enzymes, etc.. as the case may be. of 
the polypeptide; or may be structural or functional mimetics thereof (see Coligan et aL, Current Protocols in 
/mmuno/pgy 1(2) :Chapter 5 (1991)). 

[0053] The screening method may simply measure the binding of a candidate compound to the polypeptide, or to cells 
or membranes bearing the polypeptide, or a fusion protein thereof by mearts of a label directly or indirectly associated 

w . with the candidate compound. Alternatively, the screening method may involve conrpetition with a labeled conpetitor. 
Further, these screening methods may test whether the candidate compound results in a signal generated by activation 
or inhibition of the polypeptide, using detection systems appropriate to the cells bearing the polypeptide. Inhibitors of 
activation are generally assayed in the presence of a known agonist and the effect on activation by the agonist by the 
presence of the candidate conpound is observed. Constitutively active polpypeptides may be employed in screening 

IS methods for inverse agonists or inhibitors, in the absence of an agonist or inhibitor, by testing whether the candidate 
compound results in inhibition of activation of the polypeptide. Further, the screening methods may simply conprise the 
steps of mixing a candidate connpound with a solution containing a polypeptide of the present invention, to form a mix- 
ture, measuring HTHBZ47 activity in the mixture, and comparing the HTHBZ47 activity of the mixture to a standard. 
Fusion proteins, such as those made from Fc portion and HTHBZ47 polypeptide, as hereinbefore described, can also 

20 be used for high-throughput screening assays to identify antagonists for the polypeptide of the present invention (see 
D. Bennett et al., J Mol Recognition, 8:52-58 (1995); and K, Johanson etal.,4 Biol Chem, 270(16):9459-947l (1995)). 
[0054] The polynucleotides, polypeptides and antftsodies to the polypeptide of the present invention may also be used 
to configure screening methods for detecting the effect of added compounds on the production of mRNA and polypep- 
tide in cells. For example, an ELISA assay may be constructed for measuring secreted or cell associated levels of 

25 polypeptide using monoclonal and polyclonal antibodies by standard merthods known in the art. This can be used to dis- 
cover agents which may inhibit or enhance the production of polypeptide (also called antagonist or agonist, respec- 
tively) from suitably manipulated cells or tissues. 

[0055] The polypeptide nr^y be used to identify membrane bound or solubl e receptors, if any, through standard recep- 
tor binding techniques known in the art. These include, but are not limited to, ligand binding and crosslinking assays in 

50 which the polypeptide is labeled with a radioactive isotope (for instance, ^^^1), chemically modified (for instance, bioti- 
nyiated), or fused to a peptde sequence suitable for detection or purification, and incubated with a source of the puta^ 
tive receptor (cells, cell membranes, ceil supernatants, tissue extracte, bodily f liids). Other methods include biophysical 
techniques such as surface plasnron resonance and spectroscopy. These screening methods may also be used to iden- 
tify agonists and antagonists of the polypeptide which cornpete with the binding of the polypeptide to its receptors, if 

35 any. Standard methods for conducting such assays are well understood in the art 

[0056] Exanpjes of potential polypeptide arrtagonists include antibodies or, in some cases, oligonucleotides or pro- 
teins which are closely related to the ligands. substrates, receptors, enzymes, etc., as the case may be, of the polypep- 
tide, e.g., a fragment of the ligands, substrates, receptors, enzymes, etc.; or small molecules which bind to the 
polypetide of the present invention but do not elicit a response, so that the activity of the polypeptide is prwented, 

40 [0057] Thus, in another aspect, the present invention relates to a screening kit for identifying agonists, antagonists, 
ligands, receptors, sii>strates, enzymes, etc. for polypeptides of the present invention; or compounds which decrease 
or enhance the production of such polypeptides, which comprises: 

(a) a polypeptide of the present invention; 
45 (b) a recombinant cell expressing a polypeptide of the present invention; 

(c) a cell membrane expressing a polypeptide of the present invention; or 

(d) antitxxly to a polypeptide of the present invention; 
which polypeptide is preferably that of SEQ ID N0:2. 

50 [0058] It will be appreciated that in any such kit. (a), (b), (c) or (d) may comprise a substantial component. 

[0059] It will be readily appreciated by the skilled artisan that a polypeptide of the present invention may also be used 
In a method for the structure-based design of an agonist, antagonist or inhibitor of the polypeptide, by: 

(a) determining in the first instance the three-dimensional structure of the polypeptide; 
55 (b) deducing the three-dimensional structure for the likely reactive or binding site(s) of an agonist, antagonist or 
inhibitor; 

(c) synthesing candidat conpounds that are predicted to bind to or react with tiie deduced binding or reactive site; 
and 
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(d) testing whether the candidate connpounds are indeed agonists, antagonists or inhibitors. 

It will be further appreciated that this will nornially be an interative proc^^^ . 
' [0060] In a further aspect, the present invention provides methods of treating abnornial conditions such as, for 

5 instance, cancer, inflanmmation, autoimmunity, allergy, asthma, rheumatoid arthritis. CNS inflammation, cerebellar 
. degeneration. Alzheimer's disease, Parkinson's disease, multiple sclerosis, amylotrophic lateral sclerosis, head injury 
damage, and other neurological abnormalities, septic shock, sepsis, stroke, osteoporosis, osteoarthritis, ischernia 
reperfusion injury, cardiovascular disease, kidney disease, liver disease, ischemic injury, myocardial infarction, hypoten- 
sion, hypertension. AIDS, myelodysplasttc syndromes and other hematologic abnorrnaiities, aplastic anemia, male pat- 

10 tern baldness, and bacterial, fungal, protozoan and viral infections, related to either an excess of, or an under- 
. expression of. I-ITHBZ47 polypeptide activity. . . 

[0061] If the activity of the polypeptide is in excess, several approaches are available. One approach comprises 
administering to a subject In need thereof an inhibitor compound (antagonist) as hereinabove described, optionally in 
combination with a pharmaceutically acceptable carrier, in an amount effective to inhibit the function of the polypeptide, 

15 such as, for ^cample, by blocking the binding of ligands. substrates, receptors, enzymes, etc.; or by inhibiting a second 
signal, and thereby alleviating the abnormal condition. In another approach, soluble fbrnis of the polypeptides still capa- 
ble of binding the iigand, substrate, enzymes, receptors, etc. In competitioh with endogstfious polypeptide may be 
administered. Typical examples of such competitors include fragments of the HTHBZ4^ polypeptide. ' 
[0062] In still another approach, expression of^e gene encoding endogenous l-rn-IBZ47 polypeptide can be inhibited 

20 using expression blocking tediniques! Known such techniques involve the use of anti sense sequences, either internally 
generated or separately administered' (see, for example. O'Connor, J Neurochem (1991) 56:560 in Oligodeoxynude- 
otides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton. FL (1988)). Alternatively, oligonucleotides 
which fonn triple helices with the' gene can be supplied (see for example. Lee et al.. Nucleic Acids Res (1 979) 6:3073; 
Cponey et af.. Science (1988) 241:456; Dervan a/.. Science (1991) 251:1360). These oligomers can be adminis- 

25 tered perse or the relevant oligomers can be expressed /n wVo.- 

[0063] : For treating abnormal conditions related to an under-expression of HTHBZ47 and Its activity, several 
approaches are also available. One approach cbrriprises administering to a subject a therapeutically effective amount . 
of a compound which activates a polypeptide of the present invention, i.e., an agonist as described above, in combina- 
/ tion with a pharmaceutically acceptable carrier, to thereby alleviate the abnormal condition. Alternatively, gene therapy 

30 may he employed to effect the endogenous production of HTHBZ47 by the relevant cells in the subject For example, a 
polynucleotide of the invention may be engineered for expression in a replication defective retroviral vector, as dis- 
cussed above. The retroviral expression construct may then be isolated and introduced into a packaging cell trans- 
duced with a retroviral plasmid vector containing RNA encoding a polypeptide of the present invention such that the 
packaging cell now prodiices jnfectious viral particles containing the gene of interest These producer cells may be 

35 administered to a subject for engineering cells in vivo and expression of the polypeptide in vivo. For an overview of 
gene therapy, see Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, (and refer- 
ences cited therein) in Human Molecular Genetics, T Strachan and A P Read, BIOS Scientific Publishers Ltd (1995). 
Another approach is to administer a therapeutic amount of a polypeptide of the present invention in combination with a 
suitable pharmaceutical carrier. ' 

40 [0064] In a further aspect, the present invention provides for pharmaceutical cxsmpositions comprising a therapeuti- 
cally effective arnount of a fX)lypeptide, such as the solii)!^ form of a polypeptide of the present invention, ago- 
nist/antagonist peptide or small molecule cornpound, in combination with a pharmaceutically acceptable carrier or 
excipienl Such carriers include, but are not limited to, saline, buffered saline, dextrose, water. . glycerol, ethanol, and 
combinations th erect The invention further relates. to pharmaceutical pac^s and kits comprising one or more containers 

46 filled with one or more of the ingredients of tine aforementioned compositions of tiie invention. Polypeptides and other 
connpounds of the present invention may be employed alone or in conjunction with other compounds, such as therapeu- 
tic compounds. . 

[0065] The composition will be adapted to the route of administration, for instance by a systemic or an oral route: Pre- 
ferred forms of systemic administration include injection, typically by intravenous injection. Otiier injection routes, such 

50- as subcutaneous, intramuscular, or intraperitoneal, can be used. Alternative means for systemic administration include 
transmucosal and transdermal administration using penetrants such as bile salts or fusidic acids or other detergents, in 
addition, if a polypeptide or other compounds of the present invention can be formulated in an enteric or an encapsu- 
lated formulation, oral administration may also be possible. Administration of tiiese compounds may also be topical 
and/or localized, in the form of salves, pastes, gels, and the like. 

55 [0066] The dosage range required depends on tiie choice of peptide or other conpounds of the present invention, 
the route of administration, the nature of the formulation, the nature of the subject's condition, and the judgmeint of the 
attending practitioner. Suitable dosages, however, are in the rarige erf 0.M00|ig/kg of subject. Wide variations in tiie 
needed dosage, frowever, are to be expected in view of the variety of compounds available and the differing efficiencies 
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of various routes of administration. For example, oral administration vwould be expected to require higher dosages than 
administration by intravenous injection. Variations in these dosage levels can b adjusted using standard empirical rou- 
tines for optimization, as is well uriderstood In the art. . 

[0067] Polypeptides used in ti'eatmeht can also be generated ehdogenously in tiie subject, in treatment modalities' 
5 : often referred to as "gene therapy"* as descnlaed above. Thus, for example, cells from a subject may be engineered with 
a polynucleotide, such as a DNA or RNA, to encode a polypeptide ex vivo, and for example, by ttie use o1 a retroviral 
plasmid vector. The celis are then introduced into the subject. 

[0068] Polynucleotide and polypeptide sequences form a. valuable information resource witii which to ideritify further 
sequences of similar homology. This is most easily facilitated by storing the sequence iri a computer readable medium 

10 ' and then using the stored data to search a sequence database using well known searching tools, such as GCC. Accord- 
ingly, in a further aspect, the present invention provides for a computer readable medium having stored thereon a poly- 
nucleotide comprising the sequence of SEQ ID NO:1 and/or a polypeptide sequence encoded thereby. 
[0069] The following definitions are provided to fecllitate understanding of certain terms used frequently hereinbefore. 
[0070] "Antibodies" as used herein includes polyclonal and monodonal antibodies, chimeric, single chain, and 

15 humanized antibodies, as well as Fab fragments, induding the products of an Fab or other Immunoglobulin expression 
. library. . 

[0071] "Isolated" meansaltered Tay thehand of nrtan" from the natural state. If an "isolated" composition or substance 
occurs in nature, it has been changed or removed from its original environment, or both. For example, a polynucleotide 
or a polypeptide naturally present in a living animal is not Isolated," but the same polynucleotide or polypeptide sepa- 
20 ratiad from the coexisting materials of its natural state is "lsolated\ as the term is employed herein. 

[0072] "Polynudeotide" generally refers to any polyribonucleotide or polydeoocrlbonudeotlde, which nrtay be unmodi- 
fied RNA or DNA or modified RNA or DNA. "Polynudeotides" indude, without limitation, single- and double-stranded 
DNA, DNA that is a mixture of single^ and double-stranded regions, single- and double-stranded RNA. and RNA that is 
nriixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
ts strarKled or, more typically, double-stranded or a mixture of single- and double-stranded regions. In addition, "polynu- 
cleotide" refers to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The term "polynudeotide" 
also indudes DNAs or RNAs containing one or more modified bases and DNAs or RNAs with bacWt)ones modified for 
stability or for otiier reasons. "Modified" bases include, for example, tritylated bases and unusual bases such as inoslne. 
A variety of nnodifications may be made to DNA and RNA; thus, "polynucleotide'' embraces chemically. enzyrTiatically 
30 or metabolically modified forms of polynudeotides as typically found in nature, as well as the chemical forms of DNA 
and RNA characteristic of viruses and cells. "Polynucleotide" also embraces relatively short polynudeotides, often 
referred to as oligonucleotides. . . 

[0073] TbIypeptideVefers to any peptide or protein comprising two or more amino acids joined to each 0 

tide bonds or modified peptide bonds, i.e., peptide isosteres. Tolypeptide" refers to both short chains, commonly 

35 refen-ed to as peptides, oOgopeptides or oligomers, and to longer chains, generally referred ito as proteins. F^typeptides 
may contain amino acids other than the 20 gene-encoded amino acids. "Polypeptides" include amino add sequences 
nrxxltfi^ either by natural processes, such as post-translational processing, or by chemical modification techniques 
which are well known in the art. Such modrfications are well described in basic texts and in more detailed monographs. 
: as well as in a voluminous research literature. Modifications may occur anywhere in a polypeptide^ including the peptide 

do backbone, the amino add side-chains and the amino or carboxyl termini. K vyill be appreciated that the same type of 
nrodif ication may be present to the same or varying de^ees at several ates In a given polypeptide. Also, a given 
polypeptide may contain many types of rriodif Ications, Polypeptides may be branched as a result of ubiquitination. and 
they may be cydic. with or without branching Cyclic, branched and branched cyclic polypeptides may result from post- 
translation natural processes or may be made by synthetic methods. Modifications include acetylation, acylation, ADP- 

4s ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nudeotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attac^me^t of phosphoti- 
dylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, forma- 
tion of cystine, formation of pyfoglutamate, formylation, gamma-carboxylation, gtycosylation, GPI anchor formation, 
hydroxylation. iodination. methylation, myristoylatibn. oxidation, proteolytic processing, phosphorylation, prenylation. 

50 racemization; selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins such as arginylatlon, 
and ubiquitination (see, for instance. PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES. 2nd Ed., T. E. 
Creighton. W. H. Freeman and Company, New YorK 1993; WoW, F.. Post-translational Protein Modifications: Perspec- 
. lives and Prospects/ pgs. 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson. 
Ed., Academic Press, New York, 1983; Seifteir et al., "Analysis for protein modifications and nonprotein cofactors", Meth 

55 Enzymol (1990) 182:626-646 and Rattan et ai.\ "Protein Syntiiesis: Post-ti^nslational Modifications and Aging", Ann 
■ /V/ Acad Sa (1992) 663:48-6'2):^ ' 
[0074] "Variant" refers to a polynucleotide or polypeptide that diff rs from a reference polynucleotide or polypeptide, 
,but retains essential properties. A typical variant of a polynud otide differs in nudeotide sequence from anotiier, refer- 
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enc polynucleotide. Changes in th nude tide sequence of the variant may or may not alter the amino acid sequence 
of a polypeptide encoded by the refference polynuclectida Nucleotide changes may result. in amino acid sutsstitutions. 
additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as discussed 
below. Atypical variant of a polypeptide differs in amino acid sequence from another, reference polypeptide. Generally, 
differences are limited so that th sequences of the reference polypeptide and the variant ar closely similar overall and, 
in many regions, identical. A variant and reference polypeptide may differ in amino acid sequence by one or more sub- 
stitutions, additions, deletions in any oonribination. A substituted or inserted amino acid residue may or may not be one 
encoded by the genetic code. A variant of a polynucleotide or polypeptide may be a naturally occurring such as an 
allelic variant, or it may be a variant that is not known to occur naturally Non-naturaily occurring variants of polynucle- 
otides and polypeptides n^y be made by mutagenesis techniques or by direct synthesis. 

[0075] "Identity" is a measure of the identity of nudeotide sequences or amino acid sequences. In general, the 
sequences are aligned so that the highest order match is obtained. "Identity" perse has an art-recognized meaning and 
can be calculated using published techniques (see, e.g.: COMPUTATIONAL MOLECULAR BIOLOGY, Lesk, A.M., ed., 
Oxford University Press, New York. 1988; BIOGOMPUTING: INFORMATICS AND GENOME PROJECTS, Smith, D.W., 
ed.. Academic Press, New York, 1993; COMPUTER ANALYSIS OF SEQUENCE DATA, PART I, Griffin, A.M., and Grif- 
fin, H.G.. eds., Humana Press, New Jersey. 1994; SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY, von Heinje. G., 
Academic Press. 1987; and SEQUENCE ANALYSIS PRIMER. GrSDskov, M. and Devereux. J., eds.. M Stockton Press. 
New York, 1991). While there exist a number of methods to measure identity between two polynucleotide or polypeptide 
sequences, the term "identity" is welt known to skilled artisans (Carillo, H., and Upton, D., SIAM J Applied Math (1988) 
48:1073). Metiiods commonly employed to determine identity or similarity between two sequences include, but are not 
limited to, those disclosed in Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and 
Carillo, H., and Lipton, D., SIAM J Applied Math (1988) 48:1073. Methods to determine identity and similarity are cod- 
ified in computer programs. Preferred computer program metiiods to determine identity and similarity between two 
sequences indude, but are not limited to, GCQ program package (Devereux, J,, et al.. Nucleic Acids Research (1984) 
12(1):387). IBLASTP, BLASTN, and FASTA (Atschul, S.F. ef a/., J Molec B/o/ (1990) 215:403). 
. [0076] By way Of example, a polynucleotide sequence of tiie present invention may be identical to tiie reference 
sequence of SEQ ID N0:1 , that is be 100% identical, or it may include up to a certain integer number of nucleotide alter- 
ations as compared to tiie reference sequence. Such alterations are selected from the group consisting of at least one 
nudeotide deletion, substitjtion, including transition and transversion, or insertion, and wherein said alterations may 
occur at ttie 5* or 3* terminal positions of the reference nudeotide sequence or anywhere between those terminal posi- 
tions, interspersed either individually among the nudeotides in tiie reference sequence or in one or more contiguous 
groups witiiin tiie reference sequence. The number of nudeotide alterations is determined by miitiplying the total 
number of nucleotides in SEQ ID NO:l by the numerical percent of tiie respective percerit identity(divided by 100) and 
subtracting tiiat product from said total number of nucleotides in SEQ ID NO:T. or: 

nn^Xn-(3^n*y)v 

wherein n^ is tiie nunft>er of nucleotide alterations, Xp is the total number of nudeotides in SEQ ID N0:1 , and y is 0.50 
for 50%, 0.60 for 60%. 0.70 for 70%, 0.80 for 80%. 0.85 for 85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 
1 00%, and wherein any non-integer product of and y Is rounded down to the nearest integer prior to subtiacting it 
from Xn- Alterations of a polynucleotide sequence encoding the polypeptide of SEQ ID N0:2 may create nonsense, mis- 
sense or frameshtft mutations in this coding sequence and tiiereby alter tiie polypeptide encoded by the polynucleotide 
following such alterations. 

[0077] Similarly, a polypeptide sequence of the present invention may be identical to the reference sequence of SEQ 
ID N0:2, tfiat is be 100% identical, or it may include up to a certain integer number of amino acid alterations as com- 
pared to tiie reference sequence such that ttie % identity is less than 100%. Such alterations are selected from ttie 
group consisting of at least one amino acid deletion, substitution, induding conservative and non-conservative substi- 
tution, or insertion, and wherein said alterations may occur at tiie amino- or carboxy-terminal positions of tiie reference 
polypeptide sequence or anywhere between those terminal positions, interspersed eitiier individually among the amino 
acids in tiie reference sequence or in one or more contiguous groups within the reference sequence. The number of 
amino acid alterations for a given % identity is determined by multiplying the total number of amino acids in SEQ ID 
N0:2 by the numerical percent of ttie respective percent identity(divided by 100) and ttien subtracting that product from 
said total number of amino acids in SEQ ID NO :2, or: 
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wherein is the number of amino acid alterations, is the total number of amino acids in SEQ ID N0:2» and y is, for 
instance 0.70 for 70%. 0.80 for 80%, 0.85 for 86% etc.. and wherein any non-integer, product f Xa and y is r unded 
down to the nearest integer prior t subtracting it fr m Xg. 

[0078] . "Fusion protein" refers to a protein encoded by two, oft n unrelated, fused genes or fragments thereof. In one 
5 example, EP-A-0 464 dsdps s fusion proteins comprising various portions of constant region of immunoglobulin mol- 
ecules together with another human protein or part thereof. In many cases, erriploying an immunoglobulin Fc region as 
a part of a fusion protejn is advantageous for use in therapy and diagnosis resulting in, for exanrple, improved pharma- 
cokinetic properties [see. e.g.. EP-A 0232 262]. On the other hand, for some uses it would be desirable to be aUe to 
delete the Fc part after the fusion protein has been expressed, detected arid purified. 
10 [0079] All publications, iriciuding but not limited to patents and patent applications, cited in this specification are herein 
incorporated by reference as if each individual publication were specifically and individually indicated to be incorporated 
by reference herein as though fully set forth.. 
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SEQUENCE INFORMATION 
SEQIDNO:! 

CTCGCCCTGCTCCCATGGCCGGCCCCGGCTCCCCGCGCTCCCCCCTTTACCCCGGGCaSCGC 

ACTXSACGGCCCATGGGGCCGCCAGCCGCCCGCCTCGCCCtGCTCTCCGCOSCGGCGCrC^ 

CCGCGCCTAGCCCCGGCCTCGGCCCCGGACCCGAGTGTTTXyiCAGCCAATGGTCCGC^ 

ACTGGACAGCACTACAAGGCGGGAAGCCATGTCTGTTTTGGAACGAGACTTTCCAGCATCCATACi^ 

AATACCCCAACGGTAGTGGGGGCCTGGGTGAGCACAACTATTGCAGAAATCCAGATGGAG 

ACGTGAGCCCCTGGTGCTATGTGGaGAGCACGAGGATOGTGTCTACTGGAAGTACTGTGAGATACCTGCTTGC 

AGATGCCTGGAAACCTTGGCTGCTACAAGGATCATGGAAACCCACCTCCTCTAACTGGCACCAGTA 

ACAAACTCACCATACAAAGTTGCATCAGTTTTTGTCGGAGTCAGAGGTTCAAGTTTGCTGGGATGGAGTC^ 

ATGCTTGCTTCTGTGGAAACAATCCTGATTACTGGAAGTACGGGGAGGCAGCCAGTACCGAATGCAACAGCGTCT 

GCTTCGGGGATCACACCCAACCCTGTGGTGGCGATGGCAGGATCATCCTCTTTGATACT^ 

TCGTGGGCGCGTGCGGTGGGAACTACTCJ^GCCJlTGTCTTCTGTGGTCTATTCCCCTGAcfT.CC 

CCACGGGGAGGGTCTGCTACTGGACCATCCGGGTTCCGGGGGCCTCCCAPVTCCACTTCAGCTTCCCCCTATTTG 

ACATCAGGGACTCGGCGGACATGGTGGAGCTTCTGGATCGCTACACCCACCGTGTCCTAGCCCGGTTCCACGGGA 

GGAGCCGCCCACCrCTGTCCTTCAACGTCTCTCTGGACTTCGTCATCTTGTATTTCTTCTCTGA 

AaOCCCAQGaATTTQCTGTTTTATACCAAGCCGTCAAaGAAOAACyrOCCACAGGAGAOaC 

CCGCTOTCAACCAGACGGTGGCCGAGGTGATCACXKSAGCAGGCCAACCTCAGTGTCAGCGCT^ 

AAGTCCTCTATGTCATCACCACCAGCCCCAGCCACCCACCTCAAACTGTCCCAGAATGaACAGTCTAT^ 

CAACTCTCCTCATCCTCACAGTCACAGCCATTGTAGCAAAGATACTTCTGCACGTCACATTCAAATCCCATC^ 

TTCCTGCTTCAGGGGACCTTAGGGATTGTCATCAACCAGGGACTTCGGGGGAAATCTCGAGCyVTT^ 

CTTCCACTTCAATTTCCATCTTTAAGAAGAAACTCAAGGGTCAGAGTCAAGAAGATGACC 

GCAATCCCCTTGTGAGTGACTAAAAACCCCACTGTGCCTAGGACTTGAGGTCCCTCTTTGAGCTCAAGGCTGCCG 
TGGTGAACCTCTCCTGTGGTTCTTCTCTGACAGACTCTTCCCCTCCTCTCCCTCTGCCTCGGCTCTTCGGGGAAA 
CCTCCTCCTACAGACTAGGAAGAGGCACCCTGCTGCCAGGGCAGGCAGAGCTGGATTCCTCCTGCTT'CA^ 
GCACTTAGGAGAGAGACTCAAAGCCCTGGGCCCGGCCCTCTCTGCATCTCTCTCTGATCTAGCTAGCAGTGGGGG 

tgtcaggacagtgaggctgagatgacaIgaggtggtcatggctggcacagggctcaggtac 

ATTCTAGATGGCTGTCAGGTGGTGGGTAGCTTTAGTTACATTGAATTTTTCTTGC^ 
SEQroNO:2 

MAPPAARLALLSAAALTLAARPAPSPQLQPaPECPTANOADYRGTQNWTALQGGKPCLFWNET^^ 

GSGGLGBHNYCRNPDGDVSPWCyVAEHEDGVYWKyCEIPACQMPGKLGCYKDHGNPPPLTGTSKTSNKLTIOT 

SFCllSQRFKFAGMESGYACFCGNNPDyWKYGEAASTECNSVCFGDHTQPCGGDGRIILFDtLVGACGGNYSAMSS 

VVYSPDFPDTYATGRVCYWTIRVPGASHIHFSFPLFDIRDSADMVELLDGYTHRVLARFHGRSRPPLSFNVSLDF 

V I LYFFSDRINQAQGFAVLYQAVKEELPQERPAVNQTVABVITSQANIiS VS AARSSKVLY 

VirrSPSKPPQTVPEWT\nfGLATTjLILTVTAIVAKILLHVTFKSHRVPASGDLRDCHQPG 

I S I FKKKLKGQSQQDDRWPLVSD 

SEQIDNO:3 

GGCACXSAGCTGGACAGCACrrACAAGGCGGGAAGCCATQTCTGTTTTGGJUlCGAGAC^^ 

ACrCTGAAATACCCCAACGGTAGTGGGGGCCTGGGTGAGCACAACTATTGCAGAAATCCAGATGGAGACGTGAGC 
CCCTGGTGCTATGTGGCAGAGCACGAGGATGGTGTCTACTGGMGTACTGTQAGATACCTGCTTGCCAGATGCCT 
^GGAAACCTTGGCTGCTACAAGGATCATGGAAACCCACCTCCTGTAACTGGC^^ 
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ACCATACAAAGTTGCATCAGTlTTTGTCGGAGTCAGAGGTTCAAGTTTGCTGaGAT^ 

ttctgtggaaacaatcctgattactcgMgtacggggaggcaggcagtaccgaatggaacagcgtgtggttc 

GATCACACGCAACCCTGTGGTGGCGATGGCAGGATCATCCTCTTTGATACTCTCGTGGGCGCCT^ 

TACTCAGCCATGTCTTCTGtGGTCTATTCCCGTGACtTCCCaSACACCtATGCCACGGGGAGGGTd 

ACCATCCGGGTTCCGGGGGCCTCCCACATCCACTTCAGCTTCCCGGTATTTGACATCAGGGAGTCGGCGGACATG 

GTGGAGCln'GTGGATGGCTACACCCACCGTGTCGTAGCCCGCTTCCACGGGAGGAGCCGCCCA^ 

AACGTCTCTCTGGACTTGGTCATCTTGTATTTCTTCtCTGATCGCATCAATCAGGCCCAGQGATTTC 

TACCAAGCCGtGAAGGAAGAACtGCCACAGGAGAGGCCCGCTGTCAAC CAGACGGTGGCCGAGGTGGATCACGGA 

GCAGGCCAACCTTCAGTGlTCAGCGCTTCCCGGTCTCCTCAAAGTCCTATTATG'rcCATGCACCA 

SEQIDNO:4 

GTSWTALQGG KPCLFWNETF OHPYNTLKYP NGSGGLGBHN YCRNPDGDVS PWCYVAEHED 

GVYWKYCEIP ACQMPGNIiGC YKDHGNPPPIi TGTIKTSNKL TIQTCISFCR SQRFKFAGME . ' 

SGYAGFCGNN PDYVfKYGEAA STEGNSVCFG. DHTQPCGGDG RIILFDTLVG ACGGNYSAMS ^ 

SWYSPDFPD TYATGRVCYW TIRVPGASHI HFSFPLFDIR DSADMVELLD GYTHRVLARF 

- HGRSRPPLSF NVSLDFVILY FFSDRINQAQ GFAVLYQAVK EELPQERPAV NQTVAEVDHG 

AGQPSVFSAS RSPQSPIMCM H 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT 

(A) NAME: .SMITHKLINE BEECHAM CORPORATION 
, (B) STREET: ONE FRANKLIN PLAZA 

(C) CITY: PHILADELPHIA 

(D) STATE OR' PROVINCE: PENNSYLVANIA . 

(E) . COUNTRY: USA 

(F) ,POSTAL CODE: 19103 

(ii) TITLE OF THE INVENTION: A KRINGLE- RELATED CLONE, ■ 

'HrHBZ47 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER-READABLE FORM: 

{A) MEDIUM TYPE: Diskette * 

(E) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS ' 

(D) SOFTWARE: FastSEQ. for Windows Version 2.0 

(V) CURRENT APPLICATION DATA: 

\A] 'APPLICATION NUMBER: TO BE ASSIGNED 

(2) INFORMATION FOR SEQ ID N0:1: 

■ (i.) SEQUENCE CHARACTERISTICS: 
iA) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . 

(D) TOPOLOGY: linear 

(Li) MOLECULE TYPE: cDNA 

(xi) SEQUEt^CE DESCRIPTION: SEQ ID N0:1: ' - 

CTCGCCCTGC TCCCATGGCC GCCCCCGGCT CCCCGCGCTG CCCCCTTTAC CCCGGGCCGC 

GCCCCGGGGC CCCGCACTGA CGGCCCATGG CGCCGCCAGC CGCCCGCCTC GCCCTGCTCT 

CCGCCGCGGC GCTCACGCTG GCGGCCCGGC CCGCGCCTAG CCCCGG.CCTC GGCCCCGGAC 

CCGAGTGTTT CACAGCCAAT GGTGCGGATT ATAGGGGAAC ACAGAACTGG ACAGCACTAC 
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15 



20 



30 



35 



46 



50 



55 



AAGGCGGGAA 


GCCATGTGTG 


TTTTGGAACG AGACTTtCCA GCATCCATAC 


AACACTCTGA 


300 


AATACCCCAA 


CGGTAGTGGG 


GGCCTGGGTG 


AGCACAACTA 


ttgcagaaat 


CCAGATGGAG 


360 


ACGTGAGCCC 


CTGGTGCTAT" 


GTGGCAGAGC 


ACGAGGATGG 


TGTCTACTGG 


AAGTACTGTG 


420 


AGATACCTGCt 


TTGCCAGATG 


eCTGGAAACC 


TTGGCTGCTA 


CAAGGATCAT 


GGAWVCCCAC 


480 


CTCGTCTAAC 


TGGCACCAGT 


AAAACGTCCA ACAAACTCAC 


CATACAAACT 


TGCATCAGTT 


540 


TTTGTCGGAG 


■TCAGAGGTTG 


AAGTTTGCTG 


GGATGGAGTC ^ AGGCTATGCT 


TGCTTCTGTG 


- 600 


GAAACAATCC 


TGATTACtGG 


AAGTACGGGG 


AGGCAGCCAG 


TACCGAATGC 


.AACAGCGTCT 


660 


GCTTCGGGGA 


TCACACCCAA 


CCCTGTGGTG 


GCGATGGCAG GATCATCCTC 


TTTGATACTC ' 


720 


TCGTGGGCGC 


GTGCGGTGGG 


AACTACTCAG 


CCATGTCTTC TGTGGTCTAT. 


TCCCGTGACT 


780 


TCCCCGACAC 


CTATGCCACG 


GGGAGGGTCT 


GCTACTGGAC 


CATCCGGGTT 


CCGGGGGCCT 


* 840 


CCCACATCCA 


CTTCAGCTTC 


CCCCTATTTG 


ACATCAGGGA 


CTCGGCGGAC 


ATGGTGGAGC 


90O 


TTCTGGATGG 


CTACACCCAC 


CGTGTCCTAG 


CCCGCTTCCA 


CGGGAGGAGC 


CGCCCACCTC 


960 


TGTCCTTCAA CGTGTCTCTG 


GACTTCGTCA 


TCTTGTATTT 


CTTCTTCTGAT 


CGCATCAATC . 


1020 


AGGGCCAGGG atttgctgtt 


TTATACCAAG' CCGTCAAGGA/ AGAACTGeCA 


CAGGAGAGGC 


1080 


ccgctgtcaa 


CCAGACGGTG 


GCCGAGGTGA 


TCACGGAGCA 


GGCCAACCTC 


AGTGTCAGCG 


1140 


ctgcccggtc 


CTCCAAAGTC 


CrCTATGTCA 


TCACCACCAG 


CCCCAGCCAC 


CCACCTCAAA 


1200 


ctgtcccaga 


ATGGACAGTC 


TATGGTCTGG 


CAACTCTCCT 


CATCCTCACA 


GTCACAGCCA . 


126Q 


TTGTAGCAAA 


GATACTTCTG 


CACGTCACAT 


TCAAATCCCA 


TCGTGTTCCT 


GCTTCAGGGG .. 


1320 


ACCTTAGGGA -TTGTCATCAA 


CCAGGGACTT 


CGGGGGAAAT 


CTGGAGCATT 


TTTTACAAGC 


M380 


CTTCCACTTC AATTTCCATC 


TTTAAGAAGA .AACTCAAGGG 


TCAGAGTGAA 


CAAGATGACC 


1440 


GCAATCCCCT 


TGTGAGTGAC 


TAAAAACCCC 


ACTGTGCCTA 


GGACTTGAGG 


TCCCTCTTTG 


1500 


AGCTCAAGGC 


TGCCGTGGTC 


.AACCTCTCCT 


GTGGTTCTTC 


TCTGACAGAC 


TCTTCCCCTC ' 


1560 


CTCTCCCTCT 


GCCTCGGCTC 


TTCGGGGAAA 


CCTCCTCCTA 


CAGACTAGGA 


AGAGGCACGC 


^\1620 


TGCTGCGAGG 


GCAGGCAGAG 


CTGGATTCCT 


CCTGCTTCAT 


CGATTGCACT 


TAGGAGAGAG 


1680 


ACTCAAAGCC 


GTGGGCCCGG 


CCCTCTCTGq ATCTCTCTCT GATCTAGCTA 


GCAGTGGGGG 


1740 


TGTCAGGACA 


GTGAGGCTGA 


GATGACAGAG 


GTGGTCATGG 


CTGGCACAGG 


GCTCAGGTAC 


1800 


ATTCTAGATG 


GCTGTCAGGT 


GGTGGGTAGC 


TTTAGTTACA 


TTGAATTTTT 


CTTGCTTCTC 


1860 


TATTTTTGTC 


CACAG . 










1875 



(21 INFORMATION FOR SEQ rD^N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- 458 amino acids ■ ' . 

(B) TYPE: amino acid , ' 

(C) STRANDEDNESS: single ^ ' 

(D) TOPOLOGX: linear . v . 

(ii) E40LECULE^ TYPE:; protein ' ' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ala Pro Pro Ala Ala Arg Leu Ala Leu Leu Ser Ala Ala Ala Leu 

1 ^ • ^ 5' ■ /.^ ^10 " 15 



16 



EP0 911 399 A2 



thr Leu Ala Ala Arg -Pro Ala Pro Ser Pro Giy Leu Gly Pro Gly Pro 

'' ■ ' ' ■ 20 ' . ' 25 30 

S ■ ' ^^r^ Phe Thr Ala Asn Gly Ala Asp Tyr Arg Gly thir Gin Asn Trp 

' 35 40 ' ■ 45 

Thr Ala 'Leu Gin Gly. Gly Lys Pre Cys Leu Phe Trp Asn Glu Thr Phe 

• 50 ' 55 ; , ■ 60 

Gin His Pro Tyr Asn Thr Leu Lys Tyr Pro Asn Gly Ser Gly Gly Leu 

65 . ' 70 . . 75 ' 80 

' Gly Giu His Asn Tyr Cys Arg /Asn' Pro Asp Giy Asp Val ;Ser Pro Trp 

/ ^5 • ' . ' " 90 95 - 

Cys Tyr Val Ala Glu His Glu .Asp Gly Val Tyr Trp. Lys Tyr Cys Glu 
: ' ^ ' 100 " ' ' 105 110 

. ' lie Pro Ala Cys ; Gin Met Pro' Gly Asn Leu' Giy/Cys tyr Lys Asp His 
' . , - ■ ; / ■ . 120 ^ 125\ 

Gly Asn Pro Pro Pro Leu Thr Gly Thr Ser Lys thr Ser Asn Lys Leu 
• . . • * : " 140 - 

Thr He Gln Thr Gys lie Ser Phe Cys Arg Ser Gin Arg Phe. Lys Phe ■ 
145 ; 150. 155 . 160 

. Ala Gly Met Glu Ser Gly Tyr Ala Cys Phe Cys Gly Asn Asn Pro Asp 
25 ^ 165 ■ 170 • ' 175 

Tyr Trp Lys Tyr Gly Glu , Ala Ala Ser . Thr Glu-Cys Asn Ser Val Cys 
180 185 ' 190 

, . Phe Gly, Asp His thr Gin Pro Cys Gly Gly Asp Gly Arg- He He Leu 

30 - . 195; : - . 200 ; ' 205 

Phe Asp Thr Leu Val Gly Ala Cys Gly Giy Asn Tyr Ser Ala Met Ser 
. ' \ 2ltl . s • 215 220^ 

Ser Val Val Tyr Ser Pro Asp . Phe Pro Asp >thr Tyr Ala Thr Gly Arg 

' . . ' 225 230 235 240 

35 .... / ■ . . . 

Val Cys Tyr Trp Thr He Arg Val Pro Gly Ala Ser His He His Phe 

, ■ 245 . ** 250 255 

' Ser Fhe Pro Leu Phe Asp He Arg Asp Ser , Ala Asp Met Val Glu Leu 

. - 260 ■'' 265 270 

40 

Leu Asp Gly Tyr Thr His Arg Val Leu Ala Arg Phe His Gly Arg Ser 

275, . 280 ' 285 

Arg Pro Pro Leu Ser Phe Asn Val Ser Leu Asp Phe Val lie Leu Tyr 

290 • • 295 . ' ' 300 

Phe Phe Ser Asp Arg He Asn Gin Ala Gin Gly Phe Ala Val Leu Tyr 

305 ' 310 ' 315 320 

Glri.Ala Val Lys Glu Glu Leu Pro Gin Glu Arg Pro Ala Val Asn Gin 

325 330 : 335 

50 Thr Val Ala Glu Val He Thr , Glu Gin Ala Asn Leu Ser . Val Ser Ala 

/ 340 ' 345 350 

Ala Arg Ser Ser Lys Val Leu Tyr Val lie Thr Thr Ser Pro Ser His 

55'^ * 
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365 . 
thr Val Tyr Gly Leu Ala Thr Leu 

380 ••" 

Vai Ala Lys lie Leu , Leu His Val 
' 395 400 
Ala Ser Gly Asp Leu Arg Asp Cys 

'410 415 
lie Trp Ser lie Phe Tyr Lys Pro 
425 ■430 
Lys Lys Leu Lys Gly Gin Ser Gin 
445' 

Ser Asp 

(2). INFOriMATION FOR SEQ ID NO: 3: 

20 . . ' ■ 

* (i)* SEQUENCE CHAEIACTERISTICS: 

(A) LENGTH: 965 base pairs ' V" 
(3) TYPE:, nucleic acid . , , * 

25 - (C) STRANDEDNESS: single 

'{D) TOPOLOGY :^linear ■ - ' . ' 

(iil MOLECULE TYPE: cDNA 
30 ' * 



35 



'40 



45 



50 



■ (xi) 


SEQUENCE DESCRIPTION: 


SEQ ID 'N0:3: 






GGGACGAGCT 


GGACAGCACT 


ACAAGGCGGG 


AAGCCATGTC TGTTTTGGAA 


CGAGACTTTC 


,60. 


CAGCATCGAT 


ACAACACTCT 


GAAATACCGC 


AACGGTAGTG GGGGCCTGGG 


TGAGCACAAC 


, 120 


TATTGCAGAA 


■ ATCCAGATGG 


AGACGtGAGC CCCTGGTGCT ATGTGGCAGA 


GCACGAGGAT 


180 


GGTGTGTACT 


GGAAGTACTG. 


TGAGATAGCT 


GCTTGCCAGA TGCGTGGAAA- 


GCTTGGCTGC 


240 


TACAAGGATC 


ATGGAAACCC 


ACCTCCTCTA 


ACTGGCACCA TAAAAACGTC 


CAACAAACTC 


300 


ACCATACAAA 


CTTGCATCAG 


TTTTTGTCGG 


AGTCAGAGGT TCAAGTTTGC 


TGGGATGGAG 


360 


TCAGGCTATG 


CTTGCTTCTG 


TGGAAACAAT 


CCTGATTACT GGAAGTACGG GGAGGCAGCC 


4 20 


AGTACCGAAT 


GCAACAGCGT 


CTGCTTCGGG . GATCACACCC AACCCTGTGG 


TGGCGATGGC 


480 


AGGATCATCC 


TCTTTGATAC. 


TCTCGTGGGC 


GCCTGCGGTG GGAACTACTC 


AGCCATGTCT-. 


, 540, 


TCTGTGGTCT ' ATTCCCCTGA 


CTTCCCCGAC 


ACCTATGCCA CGGGGAGGGT 


CTGCTACTGG 


600 


ACCATCCGGG TTCCGGGGGC 


CTCCCACATC 


GACTTCAGCT TCCCCCTATT 


TGACATCAGG 


660 


GACTCGGCGG 


ACATGGTGGA" 


GCTTCTGGAT 


GGCTACACCC ACCGTGTCCT 


AGCCCGGTTC 


• 720 


CACGGGAGGA 


GCCGCCCACC 


TCTGTCCTTC 


AACGTCTCTC TGGACTTCGT 


CATCTTGTAT 


780 


TTCTTCTCTG 


ATCGCATCAA 


TCAGGCCCAG 


GGATTTGCTG TTTTATACCA 


AGCCGTCAAG 


840 


GAAGAACTGC 


CACAGGAGAG 


GCCCGCTGTC AACCAGACGG TGGCCGAGGT - 


GGATCACGGA . 


900 


GCAGGCCAAC XTTCAGTGTT 


CAGCGGTTCC 


CGGTCTCCTC AAAGTCCTAT 


TATGTGCATG 


960 


cacca' 










965 



- 355 ; 360 

Pro .Pro Gin Thr Val Pro Glu Trp, 

, 370 , ' '- 375' : ' 

Leu ' lie Leu Thr Vai. Thr Ala He 
385 ^ 390 

Thr Phe Lys Ser His Arg Vai* Pro 

. ■ ' ' • 405 ' ; ' ' 

His Gin Pro Gly Thr Ser 'Gly Glu 

... ^ 420 ■ - 

* Ser Thr Ser He Ser He Phe Lys 

4 35 4 40' 

GLn Asp Asp Aug Asn Pro Leu Val 
; 450 ' / 455 
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(2) INrORMATION FOR' SEQ ID,N0:4:., . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .^321 amino acids 

(B) TYPE: amino acid . . • 

(C) STRANDEDNESS: single * \ 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: , protein 

(Xi) SEQUENCE DESCRIPTION: SEQIDNO:4: 

Gly ?hr Ser Trp Thr Ala Leu Gin Gly Gly Lys Pro Cys Leu'Phe Trp 

1 . 5 . 10 ' : ■ 15 . 

Asn Glu Thr Phe Gin His, Pro Tyr Asn Thr Leu Lys Tyr Pro Asn Gly 

■ 20 25 30 

Ser Gly Gly Leu Gly Glu His Asn Tyr Cys Arg Asn Pro Asp Gly Asp 

35 . ' ' 40 . 45 

Val Ser Pro Trp Cys Tyr Val Ala Glu His Glu Asp Gly Val tyr Trp 

- 50 , , ■ .55 ' \ ... 60 

Lys Tyr Cys Glu lie Pro Ala Cys Gin Met Pro Gly Asn Leu Gly Cys 
65 70 .75 80 

Tyr Lys Asp His Gly Ash Pro Pro Pro Leu Thr Gly Thr lie Lys Thr 

'85 90 ' 95, 

Ser Asn Lys Leu Thr lie Gin Thr Cys lie Ser Phe Cys Arg Ser Gin. 

: . 100 \ 105 110 

Arg Phe^ Lys Phe Ala Gly Met Glu Ser Gly Tyr Ala Cys Phe Cys Gly 

us' , ■ 120 . , 125 • 

Asn Asn Pro Asp Tyr Trp Lys Tyr Gly Glu Ala Ala Ser Thr Glu Cys 

130 ' ^' ''135 ' 140 

Asn. Ser Val Cys Phe Gly Asp His Thr Gin Pro Cys Gly Gly Asp Gly 
145 150 155 160 

Arg He lie Leu Phe Asp Thr Leu Val Gly Ala Cys Gly Gly Asn Tyr 

165 . 170 17 5 

Ser Ala Met Ser Ser Val Val Tyr' Ser Pro Asp Phe Pro Asp Thr Tyr 

180 185 190 

Ala Thr Gly Arg Val Cys Tyr Trp Thr lie Arg Val Pro Gly Ala Ser 

195 ' * 200, _ 205 

His He His Phe Ser Phe Pro" Leu Phe Asp ,Ile Arg Asp Ser Ala Asp 

210- * ' 215 * 220 

Met Val Glu Leu Leu Asp Gly tyr Thr His Arg, Val Leu Ala. Arg 'Phe 
225 ' 230 * 235 ' . ' 240 
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His Gly hrq Ser Arg Pro Pro Leu Ser Phe Asn Val Ser Leu Asp Phe 

'245 250 255 

Val lie Leu Tyr Phe Phe Ser Asp Arg He Asn Gin Ala Gin Gly Phe 

260 265 ^ 270 

Ala' Val Leu Tyr Gin Ala Val Lys GluGlu Leu Pro Gin Glu Arg Pro 

275 ' 290 285 

Ala Val Asn Gin Thr Val Ala Glu Val Asp His Gly Ala Gly Gin Pro 

290 295 ' ' 300 

Ser Val Phe Ser Ala Ser Arg Ser Pro Gin Ser Pro lie Met Cys Met 
305 . 310 ' 315 320 

His ' 



20 Claims 

1. An i&olated polypeptide selected from the group consisting of: 

(i) an isolated polypeptide comprising an amino acid sequence selected from the group having at least: 

25 

(a) 70% identity; 

(b) 80% identity: 

(c) 90% identity; or 

(d) 95% identity 

30 • " . 

to the annino acid.sequence of SEQ ID N0:2 over the entire length of SEQ ID N0:2; 
(li) an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2 or 
(iii) an isolated polypeptide which is the amino acid sequence of SEQ ID Nd:2. 

35 2. An isolated polynucleotide selected from the group consisting of: 

(1) an isolated polynucleotide connprising a nucleotide sequence encoding a polypeptide that has at least 

(a) 70% identity; 
40 (b) 80% identity; 

(c) 90% identity: or 

(d) 95% identity; . , 

to the annino add sequence of SEQ ID NO:2, over the entire length of SEQ ID N0:2; 
-^s (ii) an isolated polynucleotide corrprising a nucleotide sequence that has at least: 

(a) 70% identity 

(b) 80% identity; 

(c) 90% identity; or 
50 (d) 95% identity; . 

over its entire length to a nucleotide sequerice encoding the polypeptide of SEQ ID N0:2; 
. (iii) an isolated polynucleotide comprising a nucleotide sequence which has at least: 

55 (a) 70% identity; 

(b) 80% identity; . 

(c) 90% identity; or 

(d) 95% identity; 
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, to that of SEQ ID NO: lover the entire length of SEQ ID N0:1 ; 

(iv) an isolated polynucleotide comprising a nucleotide sequence encoding the polypeptide of SEQ ID N0:2; 
(vi).ah isolated polynucleotide which is the polynucleotida of SEQ ID NO: 1 ; or . 

(vi) an isolated polynucleotide obtainable by screening an appropriate iSsrary under stringent hybridization con- 
, ditions with a lateled probe having the sequence of SEQ ID NO: 1 or a fragment thereof.: 
or a.nucleotide sequence complementary to said isolated polynudeotide. ^-s - 

3. An antibody immunospecrfic for the polypeptide of claim 1. 

4. A method for the treatment of a subject: 

(i) in need of enhanced activity or expression ot the polypeptide of claim 1 comprising: 

(a) administering to the subject a therapeutcally effective amount ipf an agonist to said polypeptide; and/or 

(b) providing to the subject an isolated polynucleotide comprising a nucleotide sequence encoding said 
polypetide in a form so as to effect production of said polypeptide activity in Ww.; or 

(ii) having need to inhibit activity or expression of the polypeptide . 

(a) administering to the subject a therapeutically effective amount of an antagonist to said polypeptide; 
and/or / 

(b) administering to the subject a nucleic acid molecule that inhibits; the epcpression of a nucleotide 
sequence encoding said polypeptide; and/or 

(c} administering to the subject a therapeutically effective amount of a polypeptide that competes with said 
polypeptide for its iigand, substrate, or receptor. 

5. A process for diagnosing a disease or a susceptibility to a disease in a subject related to expression or activity of 
the polypeptide of daim 1 in a subject comprising: 

(a) determining the presence or absence of a mutation in the nucleotide sequence encoding said polypeptide 
in the genome of said subject; and/or 

(b) analyzing for the presence or amount of said polypeptide expression in a sample derived from said subject. 

6,. A method for screening to identify compounds which stimulate or which inhibit the fiiriction of the polypeptide of 
claim 1 which comprises a method selected from the group consisting of : 

(a) measuring the binding of a candidate compound to the polypeptide (or to the cells or mennbranes bearing 
the polypeptide] or a fuision protein thereof by means of a label directly or indirectly associated with the candi- 
date compound; v ' ■ 

(b) measuring the binding of a candidate compound to the polypeptide (or to the cells or membranes bearing 
the polypeptide] or a fusion protein thereof in the presence of a labeled competitor; 

(c) testing whether the candidate compound results in a signal generated by activation or inhibition of the 
polypeptide, using detection systems appropriate to the cells or cell membranes bearing the polypeptide; 

(d) mixing a candidate compound with a solution containing a polypeptide of claim 1 , to form a mixture, rneas- 
uring activity of the polypeptide in the mixture, and comparing the activity of the mixture to a standard; or 

(e) detecting the effect of a candidate compound on the production of mRNA encoding said polypetide and 
said polypeptide in cells, using for instance, an ELI S A assay 

7. An agonist or an antagonist of the polypeptide of clairh 1 . 

8. Ah e)q3ression system conrprising a polynucleotide capable of producing a polypeptide of claim 1 when said 
> expression system is presertt in a compatible host cell. 

9. A host cell comprising the expression systeni of claim 8 or a membrane thereof expressincj the jDolypeptide of claim 

\: ' ■ ■ , _ ' 

10. A process for producing a polypeptide of claim 1 conprising culturing a host cell of daim 9 under conditions suffi- 
dent for. the production of said polypeptid arid recovering ih^ 
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11. A process for producing a host cell as defined in daim 9 comprising transforming or transfecting a cell with the 
expression system f claim 8 such that the host cell, under appropriate culture conditions, produces a polypeptide * 
of claim 1. ' 

s 12. A recombinant host ceil produced by the process of claim 11 or a mernbrane thereof expressing a polypeptide of 
daim 1.' ; , / ^ 

13. An isolated polyr^icleotide selected form the group consisting of: • 

10 (a) an isolated polynucleotide comprising a nudeotide sequence which has at least 70%. 80%, 90%, 95%. 

97% identity to SEQ ID N0:3 over the entire length of SEQ ID N05; ^ 

(b) ari isolated polynudeotide comprising the polynudeotide of SEQ ID N0:3; 

(c) the polynudeotide of SEQ ID N05: or 

(d) an isolated polynudeotide comprising a nudeotide sequence encoding rpoiypep*^ which has at least 
75 70%. 80%, 90%. 95%. 97-99% identity tothe anrano add sequence of SEQ ID N0:4, wer the entire length of 

SEQIDN0:4. 

■ . • " ■ . . ■ " ' ' V . ■ ' ' V ■ ■ f " 

14. A polypeptide selected from the group consisting Of: 

20 (a) a polypeptide which comprises an amino add sequence which has at least 70%. 80%. 90%, 95%, 97-99% 

identity to that of SEQ ID N0:4 over the entire length of SEQ ID N0:4; 

(b) a polypeptide which has an amino add sequence which is at least 70%, 80%, 90%. 95%, 97-99% identity 
to the amino add sequence of SEQ ID N0:4 over the entire length of SEQ ID N0:4: 

(c) a polypeptide which comprises the amino add of SEQ ID N0:4; 
25 (d) a polypeptide which is the polypeptide of SEQ ID NI0:4; 

(e) a polypeptide which is encoded by a polynucleotide comprising the sequence contained in SEQ ID Nd:3. 
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